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p ^ , Summary. This paper concentrates on asymptotic properties of determinants of some ran- 
| dom symmetric matrices. If -B njr isanxr rectangular matrix and B' nr its transpose, we study 
| det{B' n r B n ^ r ) when n,r tends to infinity with r/n — > c £ (0,1). The r column vectors of B n ^ r 
are chosen independently, with common distribution v n . The Wishart ensemble corresponds to 
v n = N(Q,I n ), the standard normal distribution. We call uniform Gram ensemble the ensem- 
ble corresponding to v n = o~ n , the uniform distribution on the unit sphere § n _i. In the Wishart 
ensemble, a well known Bartlett's theorem decomposes the above determinant into a product of 
£^ | chi-square variables. The same holds in the uniform Gram ensemble. This allows us to study the 
O ' process {Mogdet (B' n \ nt \B n ,[ n tj)j^ £ [0)1]} an d its asymptotic behavior as n — > oo: a.s. conver- 
^ ■ gence, fluctuations, large deviations. We connect the results for marginals (fixed t) with those 
\f} . obtained by the spectral method. 

o 
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£5 1 Introduction 

_ _ i 

For n,r6W such that r < n, let M n r (R) be the set of n x r matrices with real entries. 
A matrix B e M nr (R) consists in r column vectors £>i, • • • , b r of R n . We denote by B' its 
transpose, so that B'B e M r!r is symmetric. We provide R n with the usual Euclidean norm. 

In 1893, Hadamard [27] proved that 

det(B'B) < ||M 2 ---||M 2 

with equality if and only if b%, • • • ,b r are orthogonal. That means that the volume (or r- 
content) of the parallelotope built from b%, • • • , h T is maximal when the vectors are orthogonal. 
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Consequently, the quantity 



h(R\ det{B ' B) m 

{ ) " ¥¥ 77 lM 2 { ) 

is usually called the Hadamard ratio (cf. [Ej); in the basis reduction problem ([I],[H],[E]), the 
quantity 1/ ^/h(B) is called the orthogonality defect. Some papers (P,[E]) are concerned 
with the tightness of the bound h(B) < 1 when B is random and n = r. Writing B n ^ r 
instead of B to stress on dimensions, it is interesting to study the asymptotic behavior of 
the sequence of random variables h(B n r ), in particular when n, r — > oo with r /n — > c G [0, 1]. 

We consider independent random vectors b{,i = 1, • • • , r with the same distribution v n 
in W 1 . It seems natural to choose v n = a n , the uniform distribution on the unit sphere § n _i. 
The corresponding ensemble for B is called Uniform Spherical Ensemble in ^H] ■ The matrix 
ensemble for B'B is called the Gram ensemble in [T3] since B'B is the Gram matrix built 
from vectors fej's. To stress on the distribution, we call it uniform Gram ensemble. 

More generally, if v n is isotropic, (i.e. ^n({0}) = and v n invariant by rotation), it is 
well known that b\ := &i/||&i|| is a n distributed and independent of ||6i||. Denoting by B 
the matrix of unitary vectors, we see that B'B is in the uniform Gram ensemble. It makes 
possible to study det B'B in its own, since the decomposition in independent factors 

r 

det{B'B) = det{B'B) x JJ \\b t \\ 2 (2) 

i=l 

reduces this case to the previous one if the distribution of ||&i|| 2 is well behaved. 

The most important example is the Gaussian one with u n = jV(0; J n ): all the entries of 
B are i.i.d. jV(0; 1) and B'B is in the Wishart ensemble. Moreover ||&i|| 2 is Xn distributed. 
Our paper is concerned essentially with these two cases. 

We introduce a probability space on which all uniform Gram and Wishart matrices are 
defined simultaneously. It is just the infinite product space generated by a double infinite 
sequence of i.i.d. jV(0; 1) variables {bij}°° =1 . Then we take B nT = {bij,i = l,---r,j = 

l,---r} and omitting the dimension index n, we set bij = 6i,j/(X^=i ^kj) 1 ^ 2 anc ^ B n ,r = 

{h,j,i = 1, • • -r,j = 1,-- -r}. 

In Section EJ we recall some known results. Using the classical QR decomposition of B 
with Q e M n , r orthogonal and R G M T , r uppertriangular ([TU]). we get 



det{B' n>r B n>r ) = l[R 



2 

i 



In the Wishart case, the variables R]j,j = 1, • ' * i r are independent and \ 2 distributed 
with respective parameters n — j + l,j = 1, • • • ,r. This result is known as the celebrated 
Bartlett decomposition. In the Gram case, the corresponding variables R 2 j,j = 2, ••• ,r 
are independent and beta distributed with respective parameters ( n ~i + , ^r)- Therefore we 
will consider {log det(B' nr B nir ),r = !,••• , n} and its "tilde" version as triangular arrays and 
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prove pathwise 1 results for the sequence of processes |~logdet(.B^ |_ nt j-6n,Lnt|)> t G [0, 1] j. 
In Sectional we present the spectral approach. It starts from 

r 

logdet(i?; irJ B„, r ) = ^logA( fc ) , 
fc=i 

where A^r, k = 1, • ■ ■ , r are the (real) eigenvalues of B' n r B n ^ r . We may take advantage of 
known results (recalled in Section EJ) on the convergence of the empirical spectral distribu- 
tion (ESD) to the Marcenko-Pastur distribution as r/n —>■ c G (0, 1) (jSH] for the Wishart 
ensemble and [SU], [HI] for the uniform Gram ensemble). This allows in the following sections 
to recover results for marginals (only) and in the Wishart case to discover fruitful connection 
between the two methods. 

In Section 01 we study the Gram ensemble and set 

34,r = B' n r B n , r , T nir = log det y ntr . 

We state the a.s. convergence of {^T n ^ nt ^, t G [0, 1]} ( Theorem I4.2j) . the weak convergence 
of fluctuations (Theorem 14 .3|) and a large deviation principle (Theorem 14.41 and Theorem 

IPD . 

In Section|31 we study the Wishart ensemble. Since E||&i|| 2 = n, it is natural to normalize 
Wishart matrices and set 

X nr = —B' B n r , c n r = log det X n r . (3) 

n 

The asymptotic behavior of the process {-S^^j, t G [0, 1]} is easily deduced from (J2J and 
the above results in the uniform Gram case. (Of course, it is also possible to use the Bartlett 
decomposition). 

Section |U1 is devoted to some remarks about extensions to matrices with entries with 
Gaussian entries in other fields (complex, quaternionic), and even with non Gaussian entries. 
An extension to the Jacobi ensemble will be considered in a forthcoming paper. 

The proofs of results are located in Section [7| and |S1 

All along the paper we use the function £(x) = logT(x), and its derivative ^ = £' which 
is the digamma function. Some useful properties of I and ^ are given in Appendix. We use 
also the following functions: 

J[u) = ulogu — u+1, for u > 
J{u) = 1, for u = 
J{u) = +oo, for u < , 

and for t > 0, 

r* t 2 

F(t) = / J(u) du=-\ogt 
Jo ^ 

1 We stress that this study is pathwise in the parameter t - 
Wishart processes defined from Brownian matrices. 
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r/n and not in the "time" parameter as in 
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Recall also that for a > 0, the \ 2 a distribution has density 



— — 1 
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(% > 0) 



2 fr(f) 

and that, for a > 0, /? > the beta(a, /3) distribution has density 

r(a + /3) j 



T(a)r(/3) 



^(l-x)^ 1 (x>0) 



Let us end this introduction with some comments. Wishart matrices were first introduced 
in multivariate statistical analysis as sample covariance matrices : B G M n>r is a data 
matrix where r is the number of variates and n is the sample size. Then det(B'B) is Wilks' 
generalized variance (up to a transformation). It is used to build tests on the covariance 
matrix (SH])- In contemporary multivariate analysis, it is common to consider large r and 
large n (see examples in [31]) , although it may seem non standard to increase the number of 
variables for a given size of the sample. Besides, in stochastic geometry it seems interesting 
to describe the evolution of the r-content of a random r-parallelotope as r increases. In 
quantum dynamics, the uniform Gram ensemble is introduced by De Conck et al. in [21] 
and they called c (the limit oir/n) a time-parameter, although they assumed it fixed. 



2 Decompositions 

This section consists in notation and recalls. The key point is a decomposition of deter- 
minants in products and its consequence for random Gaussian matrices. 



2.1 Some linear algebra 

Every matrix B G M nr may be decomposed (see jlOj ) in a product B = QR of an 
uppertriangular matrix R G M r , r (M) and an orthogonal matrix Q G M n>r (lR). If the vectors 
bi, i = 1, ■ • • ,r are linearly independent, the decomposition is unique if we force diagonal 
elements of R to be positive. By the Gram-Schmidt method, we set c\ = b\ and for 2 < j < r 

- h - Cfc ' j 

°i ~ °i 2-^i II r II 2 ° k ' 

k=i 11 fc " 

and then build the orthonormal system: 




1 < j < r . 



This yields 

i-i 

bj =< fj, bj > fj + < fk, bj > fk (4) 

k=l 
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Now Q consists in fx, • ■ ■ , f r and R is given by 

R j,j = \\ c j II =< fj, h 3 >> l<j<r, (5) 
and for 2 < j < r and k < j — 1: 



i4j =< f k , bj >= <C ^\ > • (6) 



From (jlj) we deduce 



iim 2 = ^L- + X>^i 2 - (7) 

fe=i 

We can write &j = ||&i||6j with 6j G § n -i, so that /i, • - • , / r depend only upon &i, - - • , o r . We 
have 

R jj = W b j\\ 2R2 jj > Rjj :=< fj'bj > 2 ■ (8) 
Since i? is upper triangular and £>'!? = we get easily 

det(5'5) = n^-, (9) 

3=1 

and from (Q) and (jSJ) 

r 

= det{B'B) = Yl R) 3 . (10) 
(It is clear, of course that h(B) < 1, as Hadamard noticed). 

2.2 Random Gaussian Matrices and Bartlett's decomposition 

In the sequel, we study models of random matrices in which all entries are independent 
and A/"(0, 1) distributed. 

We are in the situation of Section I2~T1 It is clear that for every j = 1, • • ■ , r, 

Span {bx, ■ ■ ■ , b r } = Span {c x , • • ■ , c r } = Span • • • , f r } , 

and Rj j is a measurable function of (6j, • • ■ , bj) thanks to (0) and 

Proposition 2.1 T/ - 1 < r < n, the random variables R? -, j = 1, • ■ ■ ,r are indepen- 

dent and 

r>2 £ 2 

-"'jj An-(j-l) j 

where = stands for equality in distribution. 



2) If 2 < r < n, the random variables •, j = 2, • ■ ■ , r are independent and 

52 © . . fn-j + 1 j - 1\ 
= feeta ^ , J . 

The first claim is the celebrated Bartlett decomposition ( 9\). It is quoted in many books 
and articles in particular [U] pp. 170-172, J3§] pp. 99 th. 3.2.14, [33]. The second claim 
comes from 1) and (j5ji . For the sake of completeness, we give here the proof of 1), with the 
so-called "random orthogonal transformation", which may be found in [33J. 

Proof: 1) Let us fix 2 < j < r, and condition upon (b\, ■ ■ ■ , bj-i). From equation (JUJ) we 
have TZj := {Rij, ■ ■ ■ ,R^-i)j) = Fbj where F = (fx, ■ ■ ■ , fj-i) G M n j_i is a (known) 
orthogonal matrix. The Cochran theorem and (J7J) imply that TZj is jV(0; Ij-i) distributed, 
that ||7£.,-|| 2 = Yuk=i \Rk,j\ 2 is Xj-i distributed and that R 2 j = ||oj|| 2 — ||^i|| 2 is independent of 
||7£j|| 2 and Xn-u-i) distributed. Since in all the above statements, the conditioning variables 
(bx, ■ ■■ ,bj-x) did not appear, these statements are true unconditionally. In particular, R 2 j 
is independent of (b\, ■ ■ ■ This yields that all the variables R\p j = 1, ••• ,r are 

independent. 

2) From (jHJ) and the previous remarks, we see that the variables R 2 j are also independent. 
To get the distributions, recall that if U = Xa an d V = \\ are independent variables, then 
U/(U + V) = beta(a/2, 6/2). ■ 



To consider the asymptotic behavior of uniform Gram and Wishart determinants in a 
dynamic (or pathwise) way, let us give some notation. 
Set T n = 0, T n l = and for 2 < r < n 

r 

r n , r = ^io g i^. (ii) 



k=2 



which provides a first triangular array. Besides, from (JHJ) and (JHJ), for r = 1, 



where 



V :=X>WE, r=l,..,„. 

k=l 



n 



r R 2 

H n , r = vi og ^. (12) 

z — 4 n 

3=1 

provides a second triangular array. Actually, in that case, we have also, from (JHJ) and 

"n,r T n r -|- , T I, • • • , Tfl , (13) 



In this auxiliary triangular array, the independent variables (||&fc|| 2 , k = 1, • • ■ , nj are inde- 
pendent of (S„ jr ,r = 1, • • • ,n) and ||6fc|| 2 = x\ n )- The three processes are denoted by 

Y n (£) := T nj |_ ni J , ^(t) = £. n ,|ni| j Sn(t) — Sn,[nt\ > t G [0, 1] . 
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3 The spectral method 

Beside the above "decomposition method" we will use the spectral approach which we 
describe now. 

Let \n}r, k = 1, • ■ ■ ,r be the (real) eigenvalues of X nT in the regime n,r —>■ oo with 
r/n — > c < 1 fixed. We set 



r 

fc=l 



- 5 X (k) 

f ( * A n,r 

fc=i 

the empirical spectral distribution (ESD). In particular 

/i r i 
log x dfx n<r (x) = - log AjJJ = - log det Af n>r . 
r fe=i r 

For c > and ex > 0, let tt^ 2 be the probability distribution on R defined by 

((x - a 2 a(c))(a 2 b(c) - x)) 1/2 
n c a2 (dx) = (l-c- 1 ) + 5 (dx) + ± — J -^dx, (14) 

where So is the Dirac mass in 0, x + = max(x, 0) and 

a {c) = (1 - yfcf , b(c) = (1 + V~c) 2 . (15) 

It is called the Mar cenko- Past ur distribution with ratio index c and scale index a 2 (j7j p. 621). 

It is well known ([HHI, [Zj section 2.1.2) that as n,r — > oo with r/n — > c G (0,oo), 
the family of empirical spectral distributions (fi n ,r) converges a.s. weakly to the Marcenko- 
Pastur distribution 7rJ. If we replaced the common law 7V(0; 1) by Af(Q, a 2 ) then the limiting 
distribution would be tt^ 2 ■ 

Recently, De Cock et al. ([12]) and Jiang ([20]) proved that the same result holds true 
in the uniform Gram ensemble. If Xn}--, k = 1, ■ ■ ■ , r be the (real) eigenvalues of y n ^ r and 



fc=l 



then, as n — > oo and r/n-»c£ (0, oo), the family (/2 njr ) converges a.s. to 7rJ. 

In both cases (uniform Gram and Wishart) we examine in Section 0] and the connection 
between the "decomposition" method and the spectral method, at the level of marginals. 



4 Determinants in the uniform Gram ensemble 

The proofs of the results of this Section are in Section [7| The subscript or superscript 
G (resp. W) for the limiting quantities refers to the uniform Gram ensemble (resp. the 
Wishart ensemble). 

Let us notice that in the paper |T0J , a decomposition in product of beta variables for a 
completely different problem leads to similar results. 
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4.1 Two first moments and almost sure convergence 
Proposition 4.1 For the mean, we have 



lim sup 



-ET n>p + j(l 
n \ 



0, 



(16) 



and actually, 

VtG[0,l), (ET n (t)+nj(l-^- 



ET, 



n H — log n 
2 & 



d G (t) :=t + -log(l-t) 
-7-log2 + 3 



(17) 
(18) 



where 7 is tae Euler constant. 
For the variance, we have 



Vt G [0, 1) , Far T n (t) -> tr^(t) := -2 log(l -t)-2t 



( Far T„, n - 2 log n) 
Theorem 4.2 Almost surely, 

T n (t) 



lim sup 

n te[o,i] 



n 



S7 + 7T^ 

4 



+ J(l-t) 



(19) 
(20) 



The formulae ()18j) and (|2~Uj) are due to Abbott and Mulders (lemmas 4.2 and 4.4 in pQ), 
using a variant of the decomposition method. 

If we want to use the spectral method (with fixed t — c < 1) we may start with 



-rjc) 



[nc\ 



logx d/x niLnc j(x) 



n n 

use the weak convergence of '\x n ,Ync\ towards t\\, (see Sectional). To conclude that 



lim J logx dn nj[nci (x) = J logxd^i 



x 



(21) 



an additional control is necessary, since x t— > logx is not bounded. In [3U|, Jiang proved 
recently that the largest and the smallest eigenvalue of y nj1 . converge a.s. , as r /n c < 1 
to 6(c) < 00 and a(c) > respectively (remember the definitions of a and b in (jT5Jl ). So, 
(12*TJ) is true. Moreover, it is known Q32J p. 31 and |H] p. 596-597) that : 



c J logx dnl(x) 



^ lot 



X 



a(c) 



2nx 



y/ (x — a(c))(b(c) — x) dx 



(22) 



= (c - 1) log(l - c) - c = - J(l - c) 
This matches with the result of Theorem 14.21 
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4.2 Fluctuations 

Let D T = {v G E)([0,T]) : v(0) = 0} and D = {v G D([0, 1)) : v(0) = 0} the set of 
cadlag functions on [0, T] and [0, 1), respectively, starting from 0. 

Theorem 4.3 1. Let for n > 1 

^(*):=T B (0+n^(l-M) , i£[0,l). 

JTien as n — ► oo 

(^(t); t€[0,l))^(F t ; te[0,l)), (23) 
where Y is the (Gaussian) diffusion solution of the stochastic differential equation : 



1 — 2t It 

dY t = - rdt+J dB t , (24) 

2(1 -t) VI — * 

wzt/i Yo = 0, B is a standard Brownian motion and stands for the weak convergence 
of distributions in D provided with the Skorokhod topology. 

2. Let 

^, > T„ in + n+|logn 



^(1) 



V21ogn 



Then as n —>■ oo, rj^{l) =>- N where N is a standard normal variable independent of 
B ; (and ^> stands for the weak convergence of distribution in 1R). 



4.3 Large deviations 

All along this section, as in Section 15.31 and in the proof Sections 17.41 and I8.2j we use the 
notation of Dembo-Zeitouni ^H]- In particular we write LDP for Large Deviation Principle. 
The reader may have some interest in consulting jTHj where a similar method is used for a 
different model, but here we use a slightly different topology to be able to catch the marginals 
in T. 

For T < 1, let M.t be the set of signed measures on [0, T] and let A4< be the set of 
measures whose support is a compact subset of [0, 1). 

We provide D with the weakened topology cr(D,A4<). So, D is the projective limit of 
the family, indexed by T < 1 of topological spaces (Dt, ct(Dt, M.t))- 

Let Ve (resp. V r ) be the space of left (resp. right) continuous H-valued functions with 
bounded variations. We put a superscript T to specify the functions on [0,T]. There is a 
one-to-one correspondence between and M.t '■ 

- for any v G V^, there exists a unique /i G M.t such that v = /x([0, •]); we denote it by v 

- for any /i G Air, v = /u([0, ■]) stands in V r . 
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For the following statement, we need some notation. Let H be the entropy function : 

x 1 — x 

H(x\p) = x log — h (1 — x) log , (25) 

p 1 — p 

and set 2 

Lf(t,y) = iH(l-t|e»)tf(y|(-oo l O)) , Lf(t,y) = -1(1 - t)y <%|(-oc, 0)) . (26) 

Theorem 4.4 T/ie sequence T n } n satisfies a LDP in (D,a(D,Ji4 < )) at scale n 2 with 
good rate function given for v e D by: 

I^(v) = | o i L„ (t, ^(t)) dt + | o ^ Lf (t, ^(t)) dn{t) if v eV r , (27) 

where v = v a + v s is the Lebesgue decomposition of the measure v G *M([0, 1)) in absolutely 
continuous and singular parts with respect to the Lebesgue measure and \l is any bounded 
positive measure dominating v s . If v (fc V r , then I^^(v) = oo. 

That means, roughly speaking, that 

P(T n ~ v) « e~ n2l ^ {v) . 

The proof, in Section EU needs several steps. First we show that T n } satisfies a LDP in 
M.T provided with the topology <t(.Mt, Vg). Then we carry the LDP to (D T ,o-(D T ,M T )) 
with good rate function: 

feM - j m X («. » * + J [oTi Z (*. I W) *W ■ (28) 

To end the proof we apply the Dawson-Gartner theorem on projective limits (^3] Theorem 
4.6.1, see also jSSj Proposition A2). 

Let us notice that /g T ] vanishes only for ^(t) = log(l — t) and ^f(t) = (essentially) 
i.e. for v(t) = — J{\ — t), which is consistent with the result of Theorem 14.21 

The LDP for marginals is given in the following theorem, where a rate function with 
afline part appears. 

Theorem 4.5 For every T < 1, the sequence T n } n satisfies a LDP in IR at scale n 2 
with good rate function denoted by 

!«(£)= inf{/g T] (<;) ; <T)=£}. (29) 

2 we set S(y\A) = if y € A and = oo if y ^ A 
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1. If £ > —T the equation 

J(l + 29) — J(l — T + 20)—T log(l + 29) = £ , (30) 
has a unique solution, and we have 

= + 1*7(1 + 2(9) (31) 



If £ < —T, we have 



+ - (F(l) - F(l — T) — F(l + 20) + F(l - T + 20)) 
2 



I?(0 = It(-T)- 1 —^(£ + T). (32) 



5 Determinants in the Wishart ensemble 

5.1 Introduction 

Three ways are possible to study the asymptotic behavior of the determinant of 

n 

a) The spectral approach if we are intereted only in marginals (r/n —>■ c fixed) . 

b) The Bartlett's decomposition method for a dynamical study. The representation (|12|) 
leads to results similar to those of the above section. Let us remark that at the level of 
marginals, (jHJ) gives the Mellin transform: 

E(det * r , n r = g" ^/^ , ^ > -^-^ 

where 

r r (a) = +'^~ 1 )/ 4 r(a)r(a - 1) ■ • -r(a - r -^-) 

(see for instance j3H] Theorem 1 p. 347). This yields the density of detA^n (cf. [HZ], [U] 
formula 2.4, when r = n), in which the Meijer function is involved. 

c) Actually, we prefer to establish these results from the representation (fT3*j) which we 
recall here: 



log det X niT = log det 3V + S n 

IL ||2 

S n ,r = 5Z l0F 



n 

k=l 



for r = 1, • • • , n, where the variables ||&fc|| 2 , k = 1, ■ ■ ■ ,n are independent, xi distributed, 
and independent of (log det y n<r = Y n r , r = 1, • • • , n). 

We state also connections with known results deduced from the spectral approach. The 
proofs are in Section 
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5.2 Almost sure convergence and fluctuations 

By extension of the study in Section R~T1 (method b) above), we get the following. 
Proposition 5.1 For the mean we have 



lim sup 



— EH np + J(l — —] 
n n 







Moreover, as n — ► oo 

VtG[0,l) ES n , LntJ +7ij(l-M) 



E5 n ,„ + n+ - log ra 



d w (t) = -log(l-t) 
7 + log 2 - 1 



For i/ie variance we have 



W G [0, 1) FarH n>LntJ -> a^(t) := -2 log(l - t) 



VarS„ )n - 2 log n 
Theorem 5.2 Almost surely, 



27 + 7T 2 + 



lim sup 

n te[o,i] 



n 



Theorem 5.3 Let 



+ J{l-t) 



r/f(t) := ~ [nti , n + nj(l - ^) , * G [0, 1] , 



0. 



T/ien as n — > 00 



S„ >n + n + I log n 
V 2 logri 



( V W(t); te [0,1)) =► (X t , t€ [0,1)) 



TO 



AT 



where X is the Gaussian diffusion solution of the stochastic differential equation : 

1 



2(1 -t) 



dt + 



l-t 



dB 



t ■ 



(33) 



(34) 



with X = 0, where B is a standard Brownian motion and N is a standard normal variable 
independent o/B. 
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With the direct method b), Jonsson proved (}3*3*j) for fixed t (i.e. convergence in distribu- 
tion of the marginal) and deduced a convergence in probability of i nt j towards —J{\ — t) 
(Theorem 5.1 p. 29 and Corollary 5.1 p. 30 of [3*2"]). 

Let us explain now the results which may be obtained by the spectral method a). 

It is well known that the empirical spectral distribution of X n ^ r converges weakly a.s. when 
r/n — > c towards "k\. Moreover if c < 1 the largest (resp. smallest) eigenvalue converges a.s. 
to 6(c) (resp. to a(c)). For comments on these results and references, one may consult [7j 
sections 2.1.2 and 2.2.2. 

In our context, this implies easily that 

11 r f f 

— E n r = — logdet X n r = — / logx dfjL n Jx) — > c / log x dn\ (x) (35) 
n ' n n J J 

a.s. when r/n -> c 6 (0, 1). We already see in (|2*2*Jl the value of this integral. Claim (|3~5j) is 
then consistent with Theorem 14.21 

The fluctuations were studied recently by Bai and Silverstein |8] (in the case of complex 
entries). They obtained 

logdetA^ + n^l--) =>JV(log(l-c) ; -21og(l-c)) , 
which is consistent with (the marginal version of) (|33j). 



5.3 Large deviations 

Again, the three routes are possible to tackle the problem of large deviations for deter- 
minant of Wishart matrices. A direct method would use the cumulant generating function 
from Section l2~2l and would meet computations similar to those seen in the Gram case. 

To avoid repetitions, we use the b) method, drawing benefit from an auxiliary study of 

Lemma 5.4 The sequence S n (t),t G [0, 1)} satisfies a LDP in (D,a(D,M.<)) at scale 
n 2 with good rate function 

^ w =L £f (»* + L if (»' wt) 



where 



■s/..\ _ 1 r„v i\ r8f..\ y . 



L b a (y) = ^(ey~y-1) , L^) = -|<%| (-oo, 0)) , (36) 
and fi is any measure dominating dv s . 

Let us stress that the instantaneous rate functions are time homogeneous and then we may 
write [0, 1] instead of [0, 1). 
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Theorem 5.5 The sequence ^ r E ny \ nt \ ) t e [0,1)} satisfies a LDP in (D,o~(D, M.<)), at 
scale n 2 with good rate function 



[0,1) \ at j ^[ 0j i) V 



= / C ( t, ^(t))dt+ I LY ( t, -^(t) ) d»{t) (37) 



where 



LY(t,y) = \{e y -l)-\{l-t)y+ l ~J{l-t) 
LY(t,y) = -i(l-%%|(-oo,0)). (38) 
and fi is any measure dominating dv s . 

Let us notice that the restriction of to [0, T\ vanishes only for ^(t) = log(l — t) and 
^p-(t) =0 (essentially) i.e. for v (t) = — J{\ — t), which agrees with the result of Theorem 
"l 

For marginals, we give (without proof) the exact analogue of Theorem 14.51 



Theorem 5.6 For every T < 1, the sequence ^-E n ^ nT jj satisfies a LDP in 1R at scale 
with good rate function denoted by /jf . 



n 2 



1. If £ > £ T := J{T) — 1 the equation 

J(l + 20)-J(l-T + 26)=i. (39) 
has a unique solution, and we have 

It iC) = 0£ + \ (^(1) - ^(1 - T ) - F i l + 10) +F(1-T + 29)) . (40) 



2. we have 



lY(0 = I T V (T) + ^—^^ T -0- 



Let us comment the spectral approach. Hiai et Petz [2Bj proved that if n — > oo, r/n — > 
c < 1, then {/i n>r } satisfies a LDP at scale n 2 with some explicit good rate function 1^ 
given below in ()411 14*21 l4*3*j) . If the contraction \i \— > J log a; dfj,(x) was continuous, we would 
claim that logdet X n ^ nT ^} n satisfies a LDP with good rate function 



/tf(£) = inf |4 sp) (/i) ;T jlogx d^x) = ^ 



Actually 

I^)^):=- C \^) + C - J (x - {1 - c)\ogx)dn{x) + B{c) (41) 
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where 

s (aO - = J J lo & \ x ~ y\ d K x ) d Ky) ( 42 ) 

is the so-called logarithmic entropy and for c G (0, 1) 

B(c) = -i(3c-c 2 logc+(l-c) 2 log(l-c)) . (43) 

We do not know if the contraction \i \— > J log a; dfi(x) works, although not continuous. How- 
ever we will prove the following result, where for u G R we set .A(tt) = {/i : J log x dfi(x) = u}. 

Proposition 5.7 For £ > £t and 9 solution of let a 2 = 1 + 29. Then the infimum of 
l!^ p \jj) on A{^/T) is uniquely achieved for and 

17(0 = 4 SP) (^ 2 ) = inf{4 Sp) ()"); e .4(£/T)} . (44) 



Remark 5.8 1. T/ie endpoint is £r — J{T) — 1, corresponding to 1+29 = T, i.e. a 2 = T. 

2. For £ < C,t we do not know what happens. We can imagine that the infimum in 
has a solution in some extended space. 



6 Extensions 

We examine now some possible extensions of the previous results. We focus on assump- 
tions on the entries of the matrix B. We keep the same asymptotics and notation than in 
above sections. 

Let us mention that the methodology of the present paper will be applied to the Jacobi 
ensemble in a forthcoming paper. 

6.1 Independent Gaussian non real entries 

In the previous sections, entries of B were real numbers. We consider now entries in C 
and in H, the set of real quaternions. Recall that an element of 1H may be viewed as a 2 x 2 
matrix of the form 




where z and w are complex numbers. Its dual (or conjugate) is 
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denoted by Zjk for j — 1, ■ • • , n and k — 1, • • • , r. 

Let B be a n x r random matrix, and suppose the entries of i? are determined by a 
parameter /3 — 1,2 or 4. These entries are i.i.d. random variables H, C or H valued with 
Gaussian densities 

in the three cases f3 — 1, 2 and 4 respectively. If 5^ denotes the transpose of the conjugate 
of B, then the r x r matrix X = B^B belongs to the Laguerre orthogonal (resp. unitary, 
resp. symplectic) ensemble denoted LOE (resp. LUE, resp. LSE). 
Two main features of the LOE are shared by the LUE and LSE. 

• The Barlett decomposition still holds. For (3 = 2 the references are for instance |23] . 
[2n| (see also [22] Proposition 2.12 and 2.13). By the same argument as in section I2~2*l the 
random variables (Rk,j, k = 1, ■ ■ ■ , j — 1) are complex i.i.d. normal of variance 1, hence 

2|i4j| 2 = xi- Since 2||6 3 -|| 2 = x\ n we conclude that 2i? 2 j = xtn-2(j-i)- For $ = ^> tne 

reference is [22] Exercise 2.5.5. We conclude that 4i? 2 J - = xl n -4,(j-i)- 

A pathwise study of log det X in the cases (3 = 2 or 4 needs only slight modifications of 
arguments and would lead to results very similar to those of Section 

• The spectral approach is built on the probability density of the eigenvalues Xj,j = 
1, • • • , r of X which is proportional to 

3=1 l<j<k<m 

Convergence of the ESD is known not only for (3 = 2, 4 but for every (3 > (see for instance 
the Dumitriu thesis ^H] Theorem 6.5.1). 

Besides, the large deviations treated in [2E] are stated for the (real) Wishart ensemble, 
but of course are available in the general case with slight modifications since everything rests 
on their Theorem 1. 



6.2 Independent isotropic columns 

We keep independence of vectors b\,--- ,b n but assume only isotropy (in MJ 1 ) of their 
common distribution v n . The polar decomposition allows to obtain similar results as in 
Section El under convenient assumptions on the radial distribution. Let e n = log||6i|| 2 — 
logE||6i|| 2 (remember that we omit the dimension index n). 

To get convergence and fluctuations it is enough to assume 

nEe n — > ai , nVare n — > a 2 , nK(e n — Ee n ) 4 — > . (45) 

To get large deviations, it would be sufficient to assume that \ Yuk=i fog^exp (mp(k/n)e n ) 
has a limit for some convenient functions (p. 

Notice that in [3] and [S], the authors use the uniform distribution in the unit ball, so 
that the distribution of ||&i|| 2 is beta(|, l) and (}4*5|) is satisfied with a\ = —2, a 2 = 0. Here 
the contribution of the radial part is roughly "deterministic" since E||&i|| 2 is bounded. 
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6.3 Independent identically distributed (non Gaussian) entries 



If we restrict ourselves to marginals only, we may leave the Gaussian world. Let us assume 
i.i.d. (real) entries with finite variance. In (|32j. [4*3*]). the authors proved the convergence of 
the spectral distribution, using Stieljes' transform ((HE!)- In |H| Bai and Silverstein assumed 
E&fi = 3 (real entries) or = and Efe^ = 2 (complex entries), and proved a central limit 
theorem for linear statistics, with the meaningful example of logarithm of the determinant. 

The Bartlett's decomposition is not possible in the general case. Nevertheless, a product 
formula for the determinant is well known (see for example Lemma 3.1 p. 9 and formula 
4.3 p. 15 in [23J) but nothing can be said about the distribution of the components of the 
product. 

Moreover, using again the norming of column vectors as in previous sections, we may 
define B with 



A slight modification of this matrix is used in multivariate analysis to test that variates 
are uncorrelated. The matrix 3^ = B'B is a Gram matrix, built from independent vectors, 
identically distributed and living on S n _i. In this context, Jiang (jSDl) recently proved the 
convergence of the ESD to 7r£ and also the convergence of the extreme eigenvalues. It is then 
easy to deduce the convergence of the normalized logarithm of the determinant. 

This model is clearly an extension of the uniform Gram ensemble, for which De Conck 
et al. (|2Tj) proved the convergence of the ESD to 7rJ with an independent method. 

6.4 Independent isotropic rows 

We keep independence of rows of the random matrix B and assume that they are iden- 
tically distributed with an isotropic distribution on W . Actually in the data matrix B n>r , 
the index n is, as previously, the size of the sample and r is the number of variates. In [Hj . 
Yin and Krishnaiah proved the convergence of the ESD of X n r = \B' n r B n ^ r but the limiting 
distribution was not known. Actually, when the underlying distribution is uniform on §> r _i 
we can identify the limiting distribution from the result of Jiang 30 or De Cock et al. [TB*] . 

We set C — B' and then C'C is in the uniform Gram ensemble (in M nn ). The eigenvalues 
of C'C are (except with multiplicity n — r) the same as those of CC = B'B. If /x* r is the 
ESD of CC, the ESD of C'C is then 




fJ"n,r 



~<r + (1 - ~)So 



If r /n — > c < 1, hence n/r — > 1/c > 1 , 



lim jj, n , r = 7r 



l/c 



i 



( [T3*j Theorem 10 or [30J Theorem 2) so that, 
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and from (JT3)l we see that 

lim //* = 7Ty c 
n,i — >oo ' 

Besides, Yin and Krisnaiah scaling is -, so that the limit of the ESD of X n r is the image of 
iTy c by the dilatation D c : x i— > cx i.e. 7rJ (see Sectional). Moreover, the results on extreme 
eigenvalues obtained by Jiang are easily carried. 



7 Proofs of Section SI 

To stress the dependence on n we set for 2 < k < n 

hk,n = R^k ■ 

(and h\ )Tl = 1). The key tool is the cumulant generating function : 

A° k (6) := logEexp {9\ogh Kn ) = logE^JJ . (46) 
From Proposition 12. II 2). we have for 2 < k <n 

<M = * (HZ*±1 + ,) _ < (2Z*±I) _ , (» + ,) + * (») (47) 

where £ = logT. By derivation 



E(,o gft ,,„) = ,(lz*±i)_»(2) 



where \1/ = = r'/T is the digamma function. 

7.1 Proof of Proposition 14.11 

We need the following lemma. 
Lemma 7.1 For every p < n, we have 

n — 1 / n + 1 \ ( n — 2p\ fn 



n — p—1 fn — n — p fn — p + 2\ 
- ^ — — — — (4* 



and 



VarT n>p -2{H n -H n _ p -?-) <4 ^ — . (49) 



A; 2 

k=n— p+1 
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Proof of Lemma 17. It From (fTTj) and (|37jl we get 



p r 



ET„, P = $:(A n G fc )'(0) = 



n — + 1 



*'.2 



From the classical identity 



k + 2 



fc=n-p+l x ' 



and since \I/(l/2) = —7 — 2 log 2, Abbott and Mulders [T] deduced 

^ ^ A + l\ 2- 7 -21og2 

2 12/ 2 V 2 J 2 



It remains to take successively k = n + 1 and = ra — p + 1 and use (|50|). Besides 



VarT„, p = ^(Ag n )"(0) = ^ 



r fn — k + 1 



k=2 k=2 

so that (jjDj) comes from (|57jl and from (J85j) with q = 2. 



n 



(50) 



(51) 



Proof of Proposition I4.ll We have only to prove (JHJ), (fT7|) and (JTIJJ). 

1) We have 

nj" ^1 — — j = p + (n — p) log(n — p) — (n — p) log n . 



for p < n and = n for p = n. Using (|48|) and ()85j) we get ()16|). 

2) Now, for p = [nt\ , < t < 1 and n — > 00 we use the more precise estimate 
We leave the details to the reader. 

3) For the variance, we start from (|4"9*j) and we get easily ()19j) . 
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7.2 Proof of Theorem l4\2l 

Since is uniformly continuous on [0, 1] we have 

[nt\ 



lim sup 

n te[o,i] 



^ 1 



n 



J(l-t) 



0. 



so that, owing to (Ti7jj) . it is enough to prove that a.s. sup 1<p<n | Y P) „, — EY Pj „| = o{n). 
Actually this convergence is a consequence of Borel-Cantelli's lemma, Doob's inequality and 
of the variance estimate Var ^T nj „ = 0(n~ 2 \ogn) coming from (J20|) . ■ 
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7.3 Proof of Theorem HH 

Let us first notice that, thanks to the estimations of expectations in (|T7|) and (jl8j) . we 
can reduce the problem to the centered process A n (t) := T n (t) — ET„(t) and to the centered 
variable A n = A n (l)/ v / logn. 

1) We use the notation of (fTTj) and (fl"9|) . We have A n (t) = Y^k=i Vn,k where 

T] ntk := (log/l fc , n ) - E(l0g h k ,n), k <U 



is a row-wise independent arrow. To prove (|2H|) it is enough to prove the convergence in distri- 
bution in ID([0, T]), for every T < 1, of A n to a centered Gaussian process with independent 
increments, and variance a 2 G . To this purpose we apply a version of the Lindeberg-Levy- 
Lyapunov criteria (see jT^j Theorem 7.4.28 of the french edition, or [21] §3c). For t < 1, 
from (JT§|) it is enough to prove that 

LrctJ 

hm^E« fc )=0. (52) 

fe=l 

We have from (|4T)|) 

E« fc ) = (AgJ (4) (0) + 3[(Ag n )"(0)] 2 . (53) 
On the one hand, from (jUJ), (jEZj) and (JEHJ) for g = 4 

y(A^)( 4 )(0)-48y [- l___i_h <96V- ~ ttj • (54) 

^ v fe,n; w ^L(n-£; + i) 3 n 3 J/ _ ^(n-fc + l) 4 



k=l k=l 



which, for < t < 1 and p = \nt\ yields lim n 5^I=i(A.fcn)^(0) = . On the other hand, 

E[(Agn)"(0)] 2 < (sup(Aj n )"(0)) £(Ag„)"(0) (55) 
fc=i k=i 

and from (fT9j) we get lim n X^i=i(Afcn)"(0) = crg(i). Besides, is non-increasing (see (jEZj) ) 
so that 



bup(AS„) w (0) < 



n — p + 1 



and from (J85J) (again), this term tends to 0. We just checked which proves that the 
sequence of processes (A n (t),t e [0, l)) n converges to a Gaussian centered process W with 
independent increments and variance cr^. So, by (fT7|). 77^ converges to the Gaussian process 
W + with independent increments, drift and variance Oq. 
Finally, equation (|24jl comes from 



T 1 - 2s , „ , x f' 2s 



/ 2(1^)^ ' aG(t) = y r^ ds ' 
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2) When t — 1, most of the sums studied above explode and we need a renormalisation. In 
fact, for every n, the process (A n (t),t G [0, 1]) has independent increments. The conditional 
distribution of A n (l), knowing A n (ti) =£].,••• , A n (t r ) = e r for t x < ■ ■ • < t r is the same as 
£ r + Y^\nt r ]+i Formulae (jmj) and (j201) yield 

n 

E(^J=21ogn + 0(l). 

[nir]+l 

Actually we can apply the Lindeberg's theorem (with the criterion of Lyapunov) to the 
triangular array of random variables rjk, n = Vk,n/ logn with with k = [nt r ] + 1, • ■ ■ , n. It 
is enough to prove 

n 

hm^E(^J=0. (56) 

fc=l 

The route is the same as before, starting from (JH3|) . but now (JH3|) says that the sum 
ELi A i 4 i(°) is bounded. In the sum on the right (with p = n) is now equivalent 

to 2 logn and the supremum (with p = n) is bounded. This yields 

n n 

fe=i fc=i 

which proves (jHEj) . 

Then Yl^nt^+i Vk,n/ V2 logn converges in distribution to W(0, 1), and the same is true for 
the conditional distribution of A n knowing A n (£i) = £i, • • • , A n (t r ) = e r . Since the limiting 
distribution does not depend on ei, ■ • • , s r , we have proved that A n converges in distribution 
to a random variable which is J\f(0, 1) and independent of W. ■ 



7.4 Proof of Theorem 14.41 

As mentioned after the statement of the theorem, we have to prove the LDP for the 
restriction of - T n to [0, T], viewed as an element of A4t, at scale n 2 with rate function 

jg T] (m) := j\ G a (t, *£{t)j dt + j\f (t, ^(t)) Mt) ■ (57) 

Let Vi be the set of functions from [0, T] to 1R which are left continuous and have bounded 
variation, and let V e * be its topological dual when Vi is provided with the uniform convergence 
topology. 

Actually - T n e M T may be identified with an element of (see Appendix B): its 
action on cp G Vi is given by 

1 1 LnTJ /k\ 

< - t n , <p >:= - VW- ) log/i n ,fc • 
n n ' Vn/ 

fc=i 
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The proof is based on the ideas of Baldi's theorem ([T^] p.?). The main tool is the 
normalized cumulant generated function (n.c.g.f.) which here takes the form 

£n,\nt\{^) ■= ^logEexpn < T n ,v > 



Owing to pUj) we have 

[nt\ 



1 / / 



(5* 



and from (jUj) it is finite if ip (~) > - n, ~^ +1 for 1 < < [nT\ and +00 otherwise. 

In Subsection 17.4.11 we prove the convergence of this sequence of n.c.g.f. for a large class 
of functions if. It will be sufficient, jointly to the variational formula given in Subsection l7.4.2l 
to get the upperbound for probability of compact sets. Then Subsection 17.4.31 is devoted to 
exponential tightness, which allow to get the upperbound for closed sets. However, since the 
limiting n.c.g.f. is not defined everywhere, the lowerbound (for open sets) is more delicate 
than in Baldi's theorem. Actually a careful study of exposed points as in [21] is managed in 
Subsection 17.4.41 We end the proof in 17.4.51 

7.4.1 Convergence of the n.c.g.f. 

Lemma 7.2 If ip G Vi satisfies ip(t) > — ^ for every t G (0, T], then 

lim< Ln Tj(v) = Ag,T](v) := / 9{t,¥>{t)) dt , (59) 
n Jo 

where, for 6 > — (1 — t)/2 

g{t, 6) := X - (J7(l - t + 26) - J(l - t) - J{\ + 26)) . (60) 

Proof: The key point is a convergence of Riemann sums. From (J47j) and (J8U|) we have, for 
any > 

O*) = ^^log(l-^ + 2» + i) (61) 



where 







k 












n 


k 

- + 


1 




n 


n 


) 



VlJI log ( 1 — — + — ) — i og (i + 26) + R n , k {6) 



Rn,k(0) = I f(s)e 
is bounded : 



00 

n— k + 2n0 „ n — k „ n-l + 2n8 „ n — 1 

e 2 s — e 2 s — e 2 s + e 2 ■ 







POO 

\Rn,k{Q)\<2 / e-if{s)ds. 
Jo 
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If we set 

2$ n (t) := (1 -t + 2^(t))log(l -t + 2<p(t) + -) 

n 

-(1 - t) log(l -t+-)-(l-- + 2<p(tj) log(l + 2p(t)) 
then, making 9 = ip{k/n) in and adding in fc, we get from (|58|) 

On the one hand, since (p is left continuous, lim n $ n ^-^j = 2g(t, (p(t)) for every t G [0,T]. 
On the other hand the following double inequality holds true: 

2$„(t) > (l-t + 2ip{t))log(l-t + 2ip{t)) - (1 - t) log(2 - t) 

(1 + 2p(i)) log (1 + 2y>(t)) - | log (1 - t + 2<p{t)) | 
2$„(t) < (1 - * + 2p(t)) log (2 - t + 2p(t)) - (1 - t) log(l - t) 
- (1 + 2p(t)) log (1 + 2p(t)) + | log (1 - t + 2<p(t)) | , 

and with our assumptions on tp, these bounds are both integrable. This allows to apply the 
dominated convergence theorem which ends the proof of Lemma 17.21 ■ 



If there exists s < T such that 2tp(s) < — (1 — s) then for n large enough, C nt \nT\ ((f) = +oo 
and we set AS T ,(<£>) = oo. In the other cases we do not know what happens, but as in [21], 
we will study the exposed points. Before, we need another expression of the dual of Aj^ T j. 

7.4.2 Variational formula 

Let us define A^ T ^((p) = +oo if <p does not satisfy the assumption of Lemma 17.21 The 
dual of AS T j is then 

(Ag, T] )* iy) = sup {< v, if > -Ag >T] {<p) } (62) 

for e y/. Mimicking the method of Leonard ([SHI P- 112-113), we get 

( A [o.T]Y iy) = sup {< v,<p> -Ag, T] (^)} (63) 

where C is the set of continuous functions from [0, T] into R vanishing at 0. Then we apply 
Theorem 5 of Rockafellar jH]. We get 
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where 



g*{t,y) = sup{Ay- g(t, A)5(A|(-l/2, oo))} . (64) 

A 



and r is the recession function : 

r(t, y) = lim - — — — . 
Actually, if y < 0, the supremum is achieved for 

H*,y):= -\ (i-^y «*> 

and we have 

g*(t,y) = \{t,y)y-g(t,\{t,y)) 

= l[- y (l-t) + (l-t) log(l - t) + tlogt - tlog(l - e y 

= iH(l-t|e»). (66) 

If y > 0, y) = oo. The recession is now r(£, y) = — |(1 — i)y if y < 0, and = oo si y > 0. 
As a result 

g*{t,y) = L G a {t,y) , r(t, y) = L?(t, y) . (67) 
So we proved the identification ^A^ T ^j = I^ T j (recall (|57jl). 

7.4.3 Exponential tightness 

If V^* is provided with the topology a(Vl,Ve), the set £> a := {/j 6 V/ : M[o,t] < a} 
is compact according to the Banach-Alaoglu theorem. But -T n is a positive measure and 
if n ([0, T]) = ±T n (T) has a n.c.g.f. given for 6 > by 

£<r(0) := ^logEexp{n0T n (T)} = £„, LnT j (01 [O , T ]) 

l-T 



For 9 > let 



r-T 

Lr(0) := / flr(t, (6* 



o 



Lemma [7.21 savs that for fixed 9 > 



l-T 
2 



lim£„, LnT j(0J [o ,T]) = M0) (69) 

so that 

limsup^rlogP ( -T„ S J < limsup^rlogP(T n (T) > no) < -ad + L T (6) , 
n n z \n J n n z 

which proves the exponential tightness, letting a — > oo. 
Let us notice that it was not possible to take T — 1. 
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7.4.4 Exposed points 

Let 1Z be the set of functions from [0, T] into R which are positive, continuous and with 
bounded variation. Let T be the set of those m G V £ * (identified with M.t as in [35] ) which 
are absolutely continuous and whose density p is such that — p G 71. Let us prove that such 
amis exposed, with exposing hyperplane f m (t) = \(t,p(t)) (recall (JBSJ)). Actually we follow 
the method of [21]. For fixed t, g*(t, .) is strictly convex on (— oo, 0) so that, if z ^ pit), we 
have 

g*(t,p(t)) - g*(t,z) < X(t,p(t))(p(t) - z) . 

Let = l(t)dt + the Lebesgue decomposition of some element £ G .Mr such that 
^[ot](0 < 00 • Taking z = l(t) and integrating, we get 

T />T />T /-T 

g*(t,p(t)dt- g*(t,l(t))dt < A(t, p{t))p(t)dt - X(t, p(t))l(t) dt 
Jo Jo Jo 

and since f Q g*(t,l(t))dt = J Q T L%(tJ(t))dt < ?|ot](0 tms yields 

^[o,T]( m ) - ^o,t](0 < / fmdm- / / m df. 

JO JO 

Now let us prove that this set of exposed points is rich enough. We have the following lemma. 



Lemma 7.3 Let in G V r such that I^ T ^(m) < oo. There exists a sequence of functions 
l n G 1Z such that 

1. \im n l n (t)dt = —m in V e * with the a(V/,Ve) topology 

2. \im n I^ T] (-l n (t)dt) = T§ iT] (m) . 

Proof: The method may be found in [21] and in [TJJ]. The only difference is in the topology 
because we want to recover marginals. We will use the basic inequality which holds for every 
e < : 

L°(t,v + e)<L°(t,v)-^(l-t) (70) 

Let m = m a + m s such that l2 T ,(m) < oo. From and (ffilj) it is clear that — m a and 
— m s must be positive measures. 

First step We assume that m = —l(t)dt — rj with I G -^ 1 ([0, T]; dt) and rj a singular posi- 
tive measure. One can find a sequence of non negative continuous functions h n such that 
h n (t)dt — > rj for the topology cr(V^, Vi). Indeed every function ip G Vi may be written as a 
difference ipi — ip2 of two increasing functions. There exists a unique (positive) measure ot\ 
such that ipi(t) = ai([t,T\) for every t G [0,T]. Moreover, the function g = T]([0, ■]) G V r is 
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non decreasing and may be approached by a sequence of continuously derivable and non de- 
creasing functions (g n ) such that g n < g. Setting h n := g' n and v n = h n (t)dt, the dominated 
convergence theorem gives 

<ij) 1 ,v n >= [ i/ n ([0,t])ai(dt) -> f r?([0,t])o;i(dt). 
Jo Jo 

With the same result for ^ 2 we get 

<il>,v n > = [ i/ n ([0,t])ai(dt)- / i/ n ([0,t])a 2 (tft) 
Jo Jo 

7j([0, - / J7([0,t])a 2 (dO- 

'0 Jo 

or lim n < ip, v n >=< ip, r\ >. On the one hand, the lower semicontinuity of 19. rl yields 



liminf /g T] ( - (l(t) + h n {t))dt) > /g T] (m) 



On the other hand, integrating (J7UJ) yields 



lfp,T\(-{Kt) + hn(t))M) < I L a&-K-t))dt + l f (l-t)h n (t)dt 

Jo * Jo 



jf l£(t,-l(t))dt + ±J (1 - t) V (dt) = I^ T] (m) . 



Second step Let us assume that m = —l(t)dt with I G L 1 ([0, T]; cit) and for every n, let 
us set l n = max(Z,l/n). It is clear that as n — > oo, then / n J /. On the one hand the 
lower semicontinuity gives liminf a I^ T ^(—l n (t)dt) > l£ T i(—l(t)dt). On the other hand, by 
integration of inequality (fTHJ) , since l n — I < 1/n 

I§,T]Hn(t)dt) < lg T] (-l(t)dt) + . 

It is then possible to reduce the problem to the case of functions bounded below. 

Third step Let us assume that m = —l(t)dt with / G L 1 ([0, T]; <it) and bounded below by 
A > 0. One can find a sequence of continuous functions (h n ) with bounded variation such 
that h n > A/2 for every n and such that h n — > I a.e. and in -^ 1 ([0, T], dt). We have 
h n (t)dt — > Z (t ) c?t in cr(V^*, 14) an d since L^(t, •) is uniformly Lipschitz on ] — oo, —A/2], say 
with constant k, we get 

l^0>](-^n(0^) -T§,T\(-Kt)dt)\ < K I \h n (t) ~ l(t)\dt ->0. 

JO 

Actually, /i n G 7£ and <p n (0 := —h n (t)) satisfies the assumption of Lemma \7~2l since 

1 + 2^) - t >—L—. 
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7.4.5 End of the proof of Theorem 14.41 



• First step: upperbound for compact sets. We use th. 4.5.3 b) in and the following 
lemma. 



Lemma 7.4 For every 5 > and m G , there exists ip$ fulfilling conditions of \7.%\ and 
such that 

I (psdm — A^f(ips) > min ^(m) — 5, <5 _1 . (71) 
Jo L J 

• Second step: upperbound for closed sets. We use the exponential tightness. 

• Third step: lowerbound for open sets. The method is classical (see [TH] th. 4.5.20 c)), 
owing to Lemma [7.31 

To prove Lemma l7.4[ we start from the definition fl62(l or (JBHJ). One can find (fs G Ve 
satisfying (fTTj) . If <ps does not check assumptions of the lemma we add e > to <ps which 
allows to check them and satisfy (J7TJ) up to a change of S. ■ 



7.5 Proof of Theorem 14.51 

We use the contraction from the LDP for paths. Since the mapping m i— > m([0, T]) is 
continuous from D to H, the family {T n i n rj}n satisfies the LDP with good rate function 
given by (JUJ): 

inf{lg T] (<;) ; v(T) = £} . 

Fixing £, we can look for optimal t>. Let G (—(1 — T)/2, oo] (playing the role of a Lagrange 
multiplier) . 

By the duality property (JMj) 

Integrating and using (J?T7|) . (Jfi7)l and (jHEJ) we get 

/g T] (^) > W.([0,r]) - Lt{@) - \ J\l - t) dv s (t) , (72) 
For every v such that v(T) = £ it turns out that 

I§,t\( v ) > B Z ~ L M ~ \ I (l-t + 20) dv s {t) >di- L T {9) . (73) 

^ Jo 

Besides, from flfiftj) the ordinary differential equation 

A(t,0'(t)) = 
0(0) = 0, 
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admits for unique solution in C 1 ([0, T]) 

t ^ <p(0- t) := J (I + 29) -J(l-t + 29)-t log(l + 26) . 

The mapping 6 i— > <p(9;T) has a positive derivative and its limit as 6 | — (1 — T)/2 is — T. 
Moreover, by duality 

g* (t,^(e,t)j = 9^(9,t)-g(t,9). 

There are two cases. 

• If £ > — T, there exists a unique 9% such that <p(9^,T) = £ (i.e. the relation (fHU|) is 
satisfied). For := , •), we get from (J5*7|) . (|H7jl and (ffiHJ) again 

so that realizes the infimum in ([29)1 . A simple computation ends the proof of the first 
statement of Theorem 14.51 

Let us notice that at the end point £ = — T, we have 

Oi = , vHt) = J(T) - J(T — t)— ilogT , (^)'(t) = log(l - f/T) . 

Finally 

J^(-T) = T(l-T) + i(F(l)-F(l-T)-F(T)+T 2 logT) 

T(l — T) T 2 logT _ (1 — T) 2 log(l — T) 3 
4 + 4 4 + 8' 

• Let us assume £ = — T — e with e > 0. Plugging 9 = in (fTHJ) yields, for every t> 
such that v(T) = £ 

I?o, T] (v) > " ^ (-^) = ^ + I T G (-T) 

Moreover this lower bound is achieved by the measure v = (v~ T )'(t)dt — eSxit), since 

j\a(t,(v- T )'{t))dt = I?(T) , £^edSr(t) = ^le. 
That ends the proof of the second statement of Theorem 14.51 ■ 

Remark 7.5 It is possible to try a direct method to get hSl\). \32\) using Gartner-Ellis' 
theorem flldj- Theorem 2.3.6). From \69i) the limiting n.c.g.f. of Tn ^ is which is 
analytic for 9 > —^y~- When 9 I we have L' T (9) J, —T. We meet a case of so 

called non steepness. To proceed in that direction we could use the method of time dependent 
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change of probability (see JT$). We will not give details here. Nevertheless, this approach 
allows to get one-sided large deviations in the critical case T — 1. Actually we get 

lim^logP(T n , n > nx) = -if (a;) 

for x > —1. The value x — — 1 corresponds to the limit ofT n ^ n /n. Notice that the second 
(right) derivative of If at this point is zero (or equivalently lim L 1 (8) = oo as 9 j 0) , which 
is consistent with previous results on variance. I do not know the rate of convergence to of 
P(T ni „ < nx) for x < — 1. 



8 Proofs of Section 5 



8.1 Proofs of Subsection 15.21 

We use the decomposition (JT3J). We need only to notice that 



hence 



log E exp 9 log 



r(f) 



/? 



9 + 



n 



9 log 



n 



(74) 



which provides estimates for the expectation and the variance. Differentiating once and 
taking 9 = 0, we see that 



Elog 



l&il 



n 



n 



log 



n 



2/J 



sec 



which gives 



sup 

p<n 



1 

n 



oo i 

e~ s ^sf(s) ds = 

n 



oi 1 - 



O 



1lr 



(75) 



Besides, differentiating (|71jl twice and taking 9 = again, we have 



Var ( log 



n 



?? 



2 - + o' 1 



n 



(see ()88jl). which yields 



sup 

p<n 



Var S, 



Tl.p 



2p 

n 



O 



n 



(76) 



From (|75|l and ()76|) it is easy to check (via a fourth moment estimate) that S n converges in 
distribution in E>([0, 1]) to f-t + V2B t , t e [0, 1]), where B is a Brownian motion indepen- 
dent of the ex-field generated by (T n , n G IN). Finally the family of processes = rf^ + S n 
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converges in distribution towards (Y t — 1 + v^B t , t e [0, 1)). It is a Gaussian process, whose 
drift and variance are 

Mt) = d G (t) ~t=- log(l - f) , a^(t) = + 2t = -2 log(l - t) . 

which identify the process X. 
Besides, we have 



ea)=^(i) + 



s n {i) 



V21ogn 

so that the convergence of rf% (1) is clear. Moreover the independence properties seen in 
Section l4~21 remain true. 

8.2 Proofs of Subsection GT3] 

8.2.1 Proof of Lemma H31 

It is a route similar to the proof of Theorem 14.41 in Section EU (see also jlOj)- We start 
from (1741) : 



//- I - J log 2 + i (wy ( - ) + " 



logEexp < nS n ,'y >= 

fe=i 

if 7(s) + | > for every s e [0, 1]. The limiting n.c.g.f. is 

C s { 1 ) = l -j^J{l + 2 1 {t))dt, 
which yields (jSSjl by duality (see jH] again). 



7/ 



(77) 



8.2.2 Proof of Theorem 1531 

We deduce from Lemma f5 .41 and Theorem 14.41 that the sum iT n + -S n satisfies a LDP 
at the same scale with good rate function obtained by inf-convolution of i^yi and lf QT y To 
obtain (jHTJ and (jSHJ), it is possible to compute explicitely this inf-convolution: 

= M{L G a {v)+L s a {u-v)} 



w 



M{Lf(v)+L b s (u-v)}. 



Alternatively, it is possible to sum the two n.c.g.f. (()59|) and (|77j) ) and get the rate function 
by duality. ■ 



Remark 8.1 We can make the same comments as in Remark \7.5\ In particular, we get 

lim — log P(S n .„ > nx) = -If (x) (78) 

for x > — 1. This boundary point corresponds to the limit ofE njn /n. The second derivative 
is vanishing at this point, which is consistent with the results on variance. 
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8.2.3 Proof of Proposition I57H 

Let 9 be a Lagrangian factor. We begin by minimizing 

->2 



J* p (/i) - logx = y + 2 J 



B(T) 



where 



= Aa; - slogx, A = — , s = — . 

In [12] P-43 example 5.4, it is stated that for A > and 2s + 1 > fixed, the infimum 



(79) 



inf + 2 J q x ,s(x) dji(x) 



is achieved on the unique extremal measure tt^ 2 with 



a 



2s + 1 
2A 



2s + 1 



which yields, from (J79j) : 



a 2 = 1 + 29, c 



T 



a" 



Now it remains to look for 9 such that the constraint \x G A(£/T) is saturated. Since 

J \ogx dn^ 2 (x) = logo -2 + J logx dii1(x)dx , 



and thanks to (j22j) we see that 9 must satisfy 

J(l + 29) - J {I - T + 29) 



f = Tioga 2 -T- J(1 r) 



which is exactly exactly (|39|). 

To compute 7j . (7r£ 2 ), we start from the definition (J¥Tj): 

4 sp) « 2 ) = - y s « 2 ) + 1 - C 1 - T ) d <^ + 5 ( T ) > 

and transform 7r^ 2 to 7rJ using the dilatation. In particular, (J42j) yields E(7r^ 2 ) = log a 2 +E(7r£) 
and S(vrJ) may be picked from formula (13) p. 10 in j2H] : 

S(7r 1 c ) = -l + i(c- 1 + logc +(c- 1 -l) 2 log(l-c)) . 

Besides we have easily J x dn^x) = 1. After some tedious but elementary computations we 
get exactly the expression (JJUj). ■ 
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9 Appendix : Some properties of £ = log T and ^ 

From the Binet's formula ([2 j [20] P-21), we have 

1 Z" 00 

£(a;) = ( x - )\ gx-x + l+ f(s)[e~ sx -e~ s ]ds (80) 

2 Jo 

= (x-^)\ogx-x + ^\og(2n) + J™ f(s)e- sx ds . (81) 
where the function / is defined by 



/(«) = 

and satisfies for every s > 



1 1 1 
- - + 



2 s e s - 1 



oo 

- = 2 V - 

s ^— ' s 2 + 47r 2 A; 2 
fc=i 



(82) 



< f{s) < /(0) = 1/12 , < (sf(s) + \)<1. (83) 



4) 



By differentiation 

j poo pOO j 

$(x)=\ogx-- / sf(s)e~ sx ds = logx - / e~ sx (sf(s) + -)ds , 

2s Jo Jo 2 

As easy consequences, we have, for every x > 

< x(logx-V(x)) < 1, (85) 

and 



0<x 2 logx-^(x) • ( 86 ) 

Differentiating again we see that for q > 1 

/•OO 1 

= (-l)*- 1 ^ - * + (-I) 9 " 1 / e- sz s q (sf(s) + -) rfs (87) 

Jo 2 

and then 

|^ (<J) (^) - (-l) g -V^| < . (88) 
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